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T.  Summary  Of  The  Phase  I  Work 


The  research  objective  is  to  examine  process  discovery  methods  for  increasing  the 
productivity  of  materials  research  with  use  of  computational  intelligence  paradigms.  In 
particular,  studies  have  been  done  on  the  self-directed  control  of  growth  of  thin  film 
structures  with  Molecular  Beam  Epitaxy  (MBE)  and  Chemical  Vapor  Deposition  (CVD) 
processes. 

Phase  I  work  focused  on  the  study  of  procedures  and  enabling  technologies  to  achieve  the 
ultimate  goal.  The  overall  work  consists  of  four  important  tasks  as  follows  : 

1.  To  investigate  methods  for  on-line  parameter  interpretation  of  in-situ  process¬ 
monitoring  sensor  data; 

2.  To  investigate  neural-network  based  optimal  process  control; 

3.  To  illustrate  how  a  pattern  of  film  thickness  can  be  interpreted  to  generate  appropriate 
control  actions  for  correcting  non-uniformities  in  film  thickness; 

4.  To  investigate  a  memory  and  display  methodology,  called  Visual  Episodal 
Associative  Memory,  and  to  demonstrate  that  it  can  be  used  as  an  enabling 
technology  for  process  discovery. 

Task  1  was  related  to  process  modeling  and  monitoring.  Once  this  task  has  been 
achieved,  appropriate  control  actions  can  be  generated  based  on  an  estimation  of  the 
current  state  of  a  process.  In  task  2,  we  focused  our  study  on  a  feedforward/feedback 
optimal  control  methodology  with  use  of  neural  networks  and  evolutionary  programming. 
Task  3  was  an  illustration  how  a  pattern  of  film  thickness  estimated  from  task  1  could  be 
used  to  formulate  an  objective  function.  With  such  an  objective  function,  appropriate 
control  actions  can  be  generated  via  am  optimal  control  software.  Task  4  was  concerned 
with  development  of  an  intelligent  visualization  display  enabling  users  to  understand  and 
discover  the  process  behavior. 

II.  On-line  Parameter  Interpretation  Of  Sensor  Data. 

Survey  was  done  in  the  interpretation  of  optical  ellipsdmetry  measureihents  during 
growth  processes  of  multi-layer  thin  film  structures  of  semiconductor  riiaterials  with 
Molecular  Beam  Epitaxy  (MBE)  or  Chemical  Vapor  Deposition  (CVD)  processes.'  The 
mathematical  model  of  ellipsometry  is  baised  on  Fresnej,  equations,  that  allows  us  to 
determine  the  values  of  ellipsometry  measurements,  given  optical  characteristics  of  the 
film  and  substrate,  film  thickness,  wavelength  and  angle  of  incidence  [1].  The 
ellipsometry  measurements  are  two  angles  v  and  A,  which  define  the  ratio  p  of  the 
complex-amplitude  reflection  coefficients  Rp  and  for  the  p  and  ;s  polarizations.  The 

(1)  . 


corresponding  equations  are 


p  =  —  =  exp(7A)tanv|; 

R. 


1 


exp(yA)  tan  v|/  =  p(n, ,  ,  k^. ,  ,  A,,  (j)o ) 


(2) 


Equation  (2)  gives  a  direct  calculation  of  vj/  and  A  given  the  optical  characteristics  of  the 
substrate  (Uj,  kj),  of  the  film  (nf,  kf),  the  thickness  of  the  film  (df),  the  wavelength  (A)  and 
the  angle  of  incidence  (^q)-  However,  the  optical  charateristics  of  the  film  (nf,  kf)  , 
thickness  (df)  are  not  directly  measurable.  Only  the  values  of  ellipsometry  measurements 
(v|;  and  A)  can  be  measured  with  process  sensor.  The  interpretation  of  the  ellipsometry 
measurement  requires  the  inversion  of  the  Fresnel  equation.  Such  a  direct  inversion  is 
impossible.  However,  using  a  sequence  of  ellipsometry  measurements  as  indicated  in 
Figure  1,  it  is  possible  to  do  such  an  inversion  task.  Possible  methods  include  Newton’s 
methods  [2],  multiwavelength  (MW)  and  variable  angle-of-incidence  spectroscopic 
ellipsometry  (VASE)  techniques  [3-5],  multilayered  neural  networks  [6]  and  Functional 
Link  Nets  (FLN)  [7,8].  A  survey  of  current  works  indicated  that  Park’s  method  using 
FLN  is  efficient  and  capable  of  real-time  interpretation  [8].  The  added  advantage  of 
using  FLN  is  that  the  error  function  of  the  network  is  quadratic  and  so  efficient  learning 
methods  such  as  conjugate  learning  method  can  be  employed  and  global  optimization  is 
warranted  [9].  Park  divided  the  whole  region  of  interest  into  many  subregions,  and  a 
neural  network  is  used  to  learn  the  mapping  for  each  subregion.  Highly  accurate 
estimates  as  shown  in  Table  1  resulted  from  this  divide-and-conquer  method.  It  is  one  of 
the  best  current  methods  for  parameter  interpretation,  but  some  comments  on  Park’s 
current  implementation  need  to  be  made  here. 

•  In  Park’s  work,  a  huge  number  of  training  patterns  are  required  to  train  the  neural 
network  system  if  a  high  degree  of  accuracy  is  needed.  It  is  reasonable  because  of  the 
accuracy  of  a  neural  network  is  determined  by  the  training  set.  We  would  like  to 
mention  the  capability  of  AI  WARE’S  existing  CAD/CHEM  software.  It  seems  to  us 
that  the  optimization  capability  of  the  CAD/CHEM  software  can  indeed  be  utilized  to 
get  a  highly  accurate  optical  characteristics  and  film  thickness,  starting  with  initial 
values  estimated  from  a  trained  Functional  Link  Net.  With  use  of  CAD/CHEM,  we 
can  reduce  the  requirement  of  a  huge  training  set.  The  use  of  CAD/CHEM  for  the 
parameter  interpretation  will  be  tested  in  Phase  11. 
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Figure  1  Schematic  illustration  of  the  sequence  of  measurements 


Table  1  Actual  and  percentage  errors  in  estimating  film  properties  and  depths  using 
several  subregion  neural-nets  _ 


1  subregion 

errors 

3 
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actual 

percentage 

3:9 

actual 

percentage 

3:10 

actual 

percentage 

■ 

actual 

percentage 

actual 

percentage 


actual 

percentage 


actual 

percentage 


actual 

percentage 


0.01128 

0.268% 


0.00933 

0.222% 


0.00778 

0.185% 


0.00819 

0.195% 


0.00549 

0.129% 


0.00310 

0.073% 


0.00451 

0.106% 


0.00229 

0.054% 


0.00157 

0.544% 


0.00122 

0.422% 


0.00144 

0.499% 


0.00128 

0.453% 


0.00174 

0.610% 


0.00208 

0.761% 


0.00336 

1.178% 


0.00214 

0.780% 


0.097 

0.587% 


0.064 

0.396% 


0.073 

0.434% 


0.093 

0.566% 


0.345 

2.100% 


1.046 

6.285% 


0.150 

0.852% 


1.478 

9.218% 


0.083 

0.248% 


0.094 

0.277% 


0.094 

0.282% 


0.113 

0.331% 


0.371 

1.089% 


1.022 

3.035% 


0.229 

1.610% 


1.610 

4.796% 


0.119 

0.232% 


0.157 

0.308% 


0.130 

0.258% 


0.144 

0.285% 


0.444 

0.886% 


1.016 

1.994% 


0.189 

0.377% 


1.644 

3.256% 


*  in  the  subregions  of  (5:9),  (6:9)  and  (7:10),  the  properties  of  pseudosubstrate  approach  to  those  of  the 
homogenous  (infinite  thickness)  film  material  (inside  the  subregion  (7:10)).  This  is  why  the  percentage 
errors  of  the  depth  estimates  are  large  but  the  actual  errors  are  in  fact  quite  small. 


We  suggest  that  a  hybrid  method  using  a  forward  model  (Fresnel’s  equation)  and  an 
inverse  net  should  be  used.  The  outputs  of  the  inverse  net  provide  an  initial  guess  of 
optical  characteristics  and  thickness  of  the  film.  Such  an  initial  estimate  can  be  optimized 
using  CAD/CHEM  to  provide  a  more  accurate  result.  Using  such  a  .  hybrid  scheme,  a 


coarsely  trained  neural  network  may  be  sufficient  to  provide  initial  estimates  of  the 
optical  characteristics  and  thickness  of  the  film. 

TIT.  Self-Directed  Control  With  Use  Of  Feedforward/Feedback  Control 

Here,  we  will  discuss  our  study  on  the  use  of  feedforward  and  feedback  control 
methodologies.  In  general,  control  methodologies  can  be  classified  as  feedforward  (open- 
loop)  or  feedback  (closed-loop).  The  advantages  of  using  feedback  control  are  well 
understood  as  follows  : 

•  it  can  reduce  system  error; 

•  it  is  insensitive  to  external  disturbances  and  internal  variations  of  system  parameters. 

Since  a  feedback  system  is  insensitive  to  parameter  variation  and  noise,  it  is  possible  to 
control  a  process  in  a  fairly  good  manner  even  the  system  model  is  not  exact.  The  benefit 
of  using  feedback  control  methodology  is  that  we  don’t  need  a  very  accurate  system 
model.  However,  the  use  of  feedback  control  may  introduce  instability  problem,  that 
arise  when  the  system  tends  to  overcorrect  errors  that  may  cause  oscillation. 

Though  feedforward  (or  open-loop)  control  is  not  taken  as  an  advanced  control 
methodology  by  itselff  there  are  still  many  advantages  of  using  it,  namely  : 

•  stability  can  easily  be  assured; 

•  planning  a  sequence  of  control  actions  is  possible. 

Feedforward  control  is  very  sensitive  to  parameter  variations  and  external  disturbances, 
but  the  system  can  easily  be  assured  to  be  stable.  Feedback  control  systems,  on  the  other 
hand,  have  the  advantages  when  unpredictable  disturbances  and  unpredictable  variations 
of  system  parameters  occur.  Hence  a  proper  combination  of  feedforward  and  feedback 
control  is  desirable. 

AI  WARE’S  FLN  Control  Toolkit  is  a  version  of  feedforward  and  feedback  learning 
control  software.  The  block  diagram  is  shown  in  Figure  2.  It  combines  the  advantages  of 
feedforward  control,  feedback  control  and  the  learning  capability  of  neural  networks. 
Optimal  control  can  also  be  achieved  with  the  use  of  GESA  [10,11],  a  version  of 
evolutionary  programming.  Feedback  control  is  implemented  via  PID,  as  PID  is  quite  an 
industrial  standard.  The  use  of  feedforward  control  enables  us  to  tune  the  PID  gains  in 
the  optimal  manner  according  to  users’  objectives. 
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Figiffe  2.  A  version  of  optimal  control  implementation  with  use  of  optimization 


We  illustrate  the  benefit  of  optimal  control  with  use  of  FLN  Control  Toolkit  with  Figures 
3-5.  The  process  that  we  used  was  a  second-order  time-invariant  process  with  external 
disturbance  added  to  the  system  periodically.  As  illustrated  in  Figure  3,  PID  gains  were 
set  constant  during  the  whole  course  of  control  process,  the  system  was  a  standard 
feedback  control  system  and  was  able  to  return  back  to  the  set  point  slowly.  We  can  also 
minimize  a  user-defined  objective  as  illustrated  in  Figure  4,  and  Figure  5.  During 
optimal  control  tuning  process,  the  system  makes  use  of  the  open-loop  predictive 
capability  of  the  neural  network  model  to  determine  the  optimal  values  of  the  PID  gains 
so  as  to  achieve  the  user-defined  objective.  Figure  4  illustrate  the  responses  and  the 
control  actions  when  the  objective  was  to  minimize  the  sum  of  the  square  of  the  deviation 
from  the  set  point  and  the  square  of  the  control  actions.  Figure  5  illustrate  the  responses 
and  the  control  actions  when  the  objective  was  to  minimze  the  sum  of  the  square  of  the 
deviation  from  the  set  points  and  the  square  of  the  rate  of  change  of  control  actions.  Both 
Figures  4  and  5  indicate  that  the  system  returned  to  the  set  point  quickly  because  the 


deviation  from  the  set  point  was  part  of  the  objective.  The  control  action  was  smooth  in 
Figure  5  because  it  was  part  of  the  objective  as  well. 
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Figure  5  Optimal  system  response  with  optimal  PID  gains  tuned  with  GESA  to  minimize 
the  output  deviation  as  well  as  the  rate  of  change  of  control  input 


The  benefit  of  FLN  Control  Toolkit  can  be  summarized  as  follows  : 

•  it  is  a  learning  and  adaptive  control; 

•  it  is  a  combination  of  feedforward  and  feedback  control; 

•  optimal  control  can  be  achieved  through  the  use  of  objective  functions; 

•  objective  functions  can  be  dynamically  changed  to  suit  current  operating  conditions. 


IV.  Interpretation  Of  Film  Thickness  Data 


A  pie  shaped  aperture  correctly  positioned  over  a  rotating  substrate  would  yield  a  uniform 
film  (Fig.6-a).  This  is  not  the  case  if  the  aperture  is  misplaced  (Fig.6-b,  Fig.6-c). 
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(a )  Depiction  of  Pie-Shaped 
Aperture  Centered  Accurately  With 
Respect  to  Rotating  Aperture 


( b )  Depiction  of  Geometry  of 
Radially  Displaced  Aperture 


( c )  Depiction  of  Geometry  of 
Tilted  Aperture 


Figure  6  Geometry  of  Possible  Misalignments 


Misplacement  is  determined  by  the  two  parameters  bR,  linear  displacement,  and  60, 
angular  displacement,  as  shown  in  Fig.6-c.  The  value  of  the  angular  displacement  50  can 
be  restricted  to  satisfy  the  condition  O<50<0/2,  which  is  realistic  restriction.  The 
thickness  of  the  film  per  one  revolution  d{r)  at  any  point  of  the  circumference  of  radius 
r,  0<r<R  depends  on  the  value  of  the  time  which  the  point  spends,  traveling  inside  the 
aperture.  That  latter  value  is  proportional,  assuming  uniform  rotation  of  the  substrate,  to 
the  value  of  the  angle  (j)  =  (|),  +  (jjj .  Therefore, 

c/(r)  oc  <(),+<}), . 


The  simplest  case  and  perhaps  one  of  the  more  important  cases  is  that  when  that  the 
center  of  the  circle  of  radius  R  (center  of  rotation  of  the  substrate)  is  outside  of  the 
aperture.  For  that  circumstance  the  circumference  of  radius  r  intersects  both  of  the  linear 
sides  of  the  pie,  each  side  at  one  point.  It  is  obtained  that  the  thickness  would  vary  'with 
the  radial  position  r  in  the  following  manner. 


d{r) 


cc< 


0  -  arcsin 

‘  5  n 

—  sin(0  /  2  +  50) 

-  arcsin 

”  2  n 

— sin(0/2-50) 

< 

r 

r 

0, 

0<r<6R 

,  bR<r<R 


(3) 


Graphs  of  cl{r)  vs.  r  are  shown  in  Fig.  7  for  different  values  of  SR  and  50 . 


Expression  (3)  is  valid  for  all  cases  where  the  center  of  the  sunstrate  is  outside  of  the 
aperture.  Formulas  for  d{r)  for  cases  with  the  center  of  the  substrate  'within  the  aperture 
have  also  been  derived  and  are  exibited  in  Fig.  8  with  the  aid  of  some  explanatory 
illustrations  at  the  end  of  this  discussion. 


In  practice,  given  a  number  of  measurements  of  the  thicknesses  d^,d^,...df,  for  points 
situated  at  the  distances  from  the  center  the  substrate,  estimates  of  the  corrective 

control  actions  SR,SQ,  which  will  restore  the  aperture  to  the  correct  relative  position,  can 
be  obtained.  The  correct  control  actions  are  generated  by  minimizing  the  error  function 

F(5i?,60)  =  X[d;,-4r„)]  ,  (4) 

//=! 

where  d{rj,  n  =  i,...N  are  calculated  using  expressions  (3)  or  (5).  The  desired  values  SR 
and  60  are  those  which  bring  the  function  F  to  the  minimum.  Minimization  can  be  done 
using  different  efficient  stochastic  guided  procedures  such  as  GESA  [12]  or  SOFT  [13]. 
In  practice,  for  visualization  purposes,  three-dimensional  profiles  of  the  film  could  be 
displayed  for  successively  more  accurate  estimates  of  the  control  actions  SR  and  60  as 
control  progresses. 

For  the  case  when  the  center  of  the  substrate  is  inside  of  the  aperture  it  is  convenient  to 
divide  the  aperture  into  6  subregions  and  to  represent  (f)  as  the  sum  of  6  angles 
where  is  part  of  ^  lying  within  subregion  /,  /  =  1,...6  (See  Fig.  8). 

Values  of  R,...R^,e^,...B^  are  determined  by  the  relative  positions  of  the  aperture  and  the 
substrate  and  are  independent  of  the  variable  r.  Their  values  can  be  successively 
precalculated  by  the  following  formulas 
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R^=5R,  R,  =  R-5R,  6,  =6/2-56,  9;  =6/2  +  59,  = /?2 sin 6, , /?3  = 

/?4  =  -^R-  +  Rl  -2RRj cos6, , 

R^  =  ^R-  +  R^-2RR^cose^, 

R  R 

63  =  arcsin— ,  64  =6, +  63,  9^  =  arcsin-^,  65  =  6, +6^. 

The  thickness  is  given  by  the  following  formula 

4/-)  oc  5]<(). , 

l=t 


R2  sin  63 


where 


(j),  =  •<  arcsin  — 


0,  r>/?, 

— sin6,j-6|,  /?|<r</^, 
7c/2-6|,  0<r<Rj 


0,  /•>/?, 


(jjj  =  •  arcsin^-2-sin92  j-62,  Rj<r<R^ 
7i/2-6j,  Q<r<R. 


0,  r>R, 

([>3  =  •  arcsin^— sin  65  j,  Ry^r<  Ra  , 
71/2-63,  0<r<^3 


<1)4  =  1 64  -  arccos 


0,  r>R, 
'R^-r^-l^_' 

2rR,  J’ 

64,  Q<r<R, 


R^<r<R^  , 


4)5  =  1 65  -  arccos 


r>R^ 

R^-r^-R;^ 


,  R^<r<R^  , 


65,  0<r<R^ 


0,  r>R, 


(|)g  =  ■  arcsin^— sin9jj,  R,<r 
7i/2-9g,  0<r</?, 


R,<r<R,  . 
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Figure  8  Schematic  illustration  of  the  subregions 
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V.  Visual  EAM:  An  Intelligent  Visualization  Tool  For  Discovery 

Visual  Episodal  Associative  Memory  (Visual  EAM)  is  a  powerful  intelligent 
visualization  tool  developed  by  AI  WARE.  Though  it  is  not  so  directly  related  to  process 
control,  it  is  very  important  by  itself  in  the  context  of  discovery.  The  idea  of  Visual  EAM 
is  that  of  projecting  high-dimensional  objects  into  two-dimensional  space  in  a  manner 
that  meaningful  information  can  be  extracted  and  visualized.  In  this  report,  we  will 
describe  some  of  the  usages  of  Visual  EAM. 


Table  2  List  of  objects 


Original 

Attributes 

Transformed  Coordinates 

Label 

al 

a2 

a3 

a4 

a5 

X 

y 

A 

1 

0 

0 

0 

0 

0.999508 

0.999562 

B 

2 

0 

0 

0 

0 

0.999506 

0.999557 

C 

3 

0 

0 

0 

0 

0.999503 

0.999551 

D 

4 

0 

0 

0 

0 

0.999498 

0.999544 

E 

5 

0 

0 

0 

0 

0.999492 

0.999535 

F 

3 

1 

0 

0 

0 

0.714601 

0.648197 

G 

3 

2 

0 

0 

0 

0.469410 

0.354227 

H 

0 

0 

0 

0.455890 

0.271452 

I 

3 

4 

0 

0 

0 

0.972253 

0.060956 

J 

J 

5 

0 

0 

0 

0.999640 

0.001557 

K 

3 

J 

1 

0 

0 

0.388928 

0.224456 

L 

3 

3 

2 

0 

0 

0.337869 

0.182086 

M 

3 

3 

3 

0 

0 

0.298414 

0.143284 

N 

3 

J 

4 

0 

0 

0.257642 

0.109643 

0 

n 

J 

J 

5 

0 

0 

0.220664 

0.070379 

3 

3 

6 

0 

0 

0.201904 

0.040221 

3 

*> 

J 

7 

0 

0 

0.198844 

0.017906 

n 

J 

3 

8 

0 

0 

0.204662 

0.000990 

3 

3 

J 

1 

0 

0.218765 

0.177039 

3 

3 

J 

2 

0 

0.168311 

0.197174 

3 

n 

J 

3 

n 

J 

0 

0.116724 

0.215950 

3 

J 

3 

4 

0 

0.046178 

0.245005 

J 

3 

6 

1 

0 

0.143387 

0.083768 

X 

:> 

J 

6 

2 

0 

0.103514 

0.116506 

Y 

3 

J 

6 

3 

0 

0.067075 

0.147464 

Z 

3 

J 

6 

4 

0 

0.024474 

0.184802 

1 

3 

6 

2 

1 

0.084070 

0.095475 

2 

3 

3 

6 

2 

2 

0.066193 

0.076672 

J 

3 

3 

6 

2 

3 

0.050128 

0.059229 

4 

3 

n 

J 

6 

2 

4 

0.034308 

0.042277 

5 

3 

3 

6 

2 

5 

0.017574 

0.023258 

6 

3 

3 

6 

2 

6 

0.000572 

0.000668 
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Consider  a  list  of  objects  with  five  attributes  as  illustrated  in  Table  2  [14],  These  objects 
are  labeled  A, B,C,...., 1,2, 3,4, 5, 6.  There  is  no  special  meaning  attached  to  these  labels. 
Each  of  the  objects  is  described  in  terms  of  the  values  of  their  five  attributes.  These 
objects  can  be  transformed  nonlinearly  into  a  two-dimensional  space  using  Visual  EAM. 
It  is  clearly  impossible  to  display  all  the  information  contained  in  a  five-dimensional 
space  relationship  in  a  two-dimensional  display,  but  it  is  possible  to  devise  a  meaningful 
display,  curtailed  and  abstracted  but  nevertheless  convenient  and  suitable  for  many 
purposes.  A  display  obtained  with  Visual  EAM  is  shown  in  Figure  9. 


0  0.1  0.2  0.3  0.4  0.5  0.6  0.7  0.8  0.9  1 


(a) 

0.999565 
0.99956 
0.999555 
0.99955 
0.999545 
0.99954 
0.999535 

0.99949  0.999495  0.9995  0.999505  0.99951 

(b) 

Figure  9  Two-dimensional  display  of  the  objects  obtained  with  Visual  EAM: 
(a)  in  normal  0  to  1  scale;  (b)  with  zoom-in  of  the  top  right  comer 
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It  is  seen  that  objects  which  are  close  in  five-dimensional  space  are  also  close  in  this  two 
dimensional  display.  The  objects  A,B,C,D,and  E  are  all  squashed  at  the  upper  right-hand 
comer  of  Figure  9.  However,  rooming  in  onto  that  part  of  the  display,  it  may  be  observed 
that  the  five  objects  are  apart  but  very  close  to  each  other.  Examinmg  the  values  of  the 
attributes  in  Table  2  shows  that  it  is  reasonable  and  informative. 

As  an  illustration  of  a  use  of  such  a  display  consider  the  task  of  exploring  the  question  of 
what  objects  have  high  values  in  the  third  attribute  and  those  which  have  high  values  in 
the  fourth  attribute.  The  intersection  of  the  two  separate  distributions  is  readily  observed 
as  illustrated  with  Figure  10.  This  is  a  simple  illustration  of  the  many  different  ways  in 
which  this  interesting  display  can  be  utilized. 


X0bectswitha3  =  6 
s  Objects  with  a4  =  4 


Figure  10  Illustration  of  the  usage  of  Visual  EAM  for  discovery  of  relationship  of  the 
attributes 

Investigation  also  has  been  done  on  the  use  of  Visual  EAM  for  categorization  and  rule 
discovery.  To  demonstrate  that,  we  used  a  gasoline  blending  task  [15]  as  an  example. 
The  set  of  the  training  data  is  shown  in  Table  1 .  The  task  of  the  blending  problem  is  to 
find  out  what  combinations  of  the  input  ingredients  provide  a  high  octane  amount  of  over 
100  (class  =  1).  It  has  been  demonstrated  by  CWRU  that  it  is  impossible  to  find  out  the 
proper  clusters  »sing  K-means  or  other  unsupervised  learning  methods.  We  used  a 
version  of  Visual  EAM  and  formd  out  that  the  two  classes  are  indeed  able  to  be  separated, 
as  illustrated  in  Figure  11.  The  high  octane  class  was  foirnd  to  locate  at  the  top-left 
comer  of  the  two-dimensional  map,  whereas  the  low  octane  class  was  located  at  the 
bottom-right  comer. 
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Table  3  List  of  training  set  for  gasoline  blending  task 


al 

(Butane) 

a2 

(Isopenetane) 

a3 

(Reformate) 

a4 

(CatCracked) 

a5 

(Alkylate) 

Class 

0 

0 

0.35 

0.6 

0.06 

1 

0 

0.3 

0.1 

0 

0.6 

1 

0.15 

0 

0.15 

0.6 

0.1 

0 

0 

0.3 

0.049 

0.6 

0.051 

0 

0.15 

0.127 

0.023 

0.6 

0.1 

0 

0 

0.158 

0.142 

0.1 

0.6 

1 

0.067 

0.098 

0.234 

0.332 

0.27 

1 

0 

0.126 

0.174 

0.6 

0.1 

0 

0.075 

0 

0.225 

0.6 

0.1 

0 

0 

0.3 

0 

0.1 

0.6 

1 

0.15 

0.15 

0.1 

0.6 

0 

0 

0 

0.3 

0 

0.489 

0.211 

0 

0.15 

0 

0.311 

0.539 

0 

0 

0 

0.3 

0.285 

0.415 

0 

0 

0 

0.08 

0.35 

0.57 

0 

1 

0.15 

0.15 

0.266 

0.434 

0 

0 

0.15 

0.15 

0.082 

0.018 

0.6 

1 

0 

0  ' 

0.3 

0.461 

0.239 

1 

0.15 

0.034 

0.116 

0.1 

0.6 

1 

0.068 

0.121 

0.175 

0.444 

0.192 

0 

0 

0.3 

0.192 

0.208 

0.3 

1 

0.15 

0.15 

0.174 

0.226 

0.3 

1 

0.075 

0.225 

0.276 

0.424 

0 

0 

0.075 

0.225 

0 

0.1 

0.6 

1 

0.15 

0.15 

0 

0.324 

0.376 

0 

0 

0.3 

0.192 

0.508 

0 

0 

1 

0.9 

0.8 

0.7 

0.6 

0.5 

0.4 

0.3 

0.2 

0.1 

0 


s - - — 

- - : - ^ 

• 

• 

— — m 

0.2 


0.4 


0.6 


0.8 


♦  Low  Octane  ' 
I  ■  High  Octane  I 


Figure  1 1  Illustration  of  separation  of  two  classes  with  use  of  Visual  EAM 
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To  further  discover  the  relationship  of  the  process,  we  pointed  to  the  memory  points  in 
the  map  and  the  original  input  ingredients  were  revealed.  Investigation  of  some  of  the 
memory  points  in  the  map  indicated  that  a5  (Alkylate)  is  high.  A  hypothesis  like  if 
Alkylate  is  high,  the  octane  should  be  high”  naturally  comes  to  our  mind.  This 
hypothesis  can  easily  be  verified  with  the  associative  memory  capability  of  the  Visual 
EAM.  We  used  the  association  mode  to  display  all  the  data  with  a5  >  4.  The  display  is 
shown  in  Figure  12.  It  is  clearly  that  the  hypothesis  is  true  for  the  training  set.  The 
negation  of  the  hypothesis  then  became  the  second  hypothesis.  That  is  “if  Alkylate  is 
low,  octane  level  should  be  low”.  Visual  EAM  displayed  the  memory  points  with  low 
Alkylate  (say  a5  <=  4)  as  shown  in  Figure  13.  Clearly  the  second  hypothesis  is  not  true. 
We  then  revealed  the  points  and  we  continued  the  steps,  we  got  the  following  qualitative 
rules: 

•  If  a5  is  high,  octane  level  is  high. 

•  If  a5  is  medium,  and  a3  is  high  and  a4  is  high,  octane  level  is  high. 


I  ■High  Octane 


Figure  12  Display  of  all  memory  points  with  a5  >  4 
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1 

0.9 

0.8 

0.7 

0.6 

0.5 

0.4 

0.3 

0.2 

0.1 

0 


*  ■■  ■ — - 

■ 

♦♦ 

♦  X 

- ^ 

0.2 


0.4 


0.6 


0.8 


♦  Low  Octane 
■  High  Octane 


Figure  13  Display  of  all  memory  points  with  a5  <=  4 

This  is  a  simple  illustration  of  a  man-machine  interaction  for  process  discovery. 
Hypothesis  and  questions  are  stimulated  and  generated  with  a  display  supported  by 
Visual  EAM.  These  hypothesis  can  be  verified  using  the  associative  memory  capability 
of  Visual  EAM.  Another  display  is  then  generated  and  may  stimulate  other  hypothesis. 
This  is  another  usage  of  Visual  EAM  for  rule  discovery. 

Another  comment  should  be  made  here  to  indicate  the  potential  of  Visual  EAM.  We 
demonstrated  one  of  many  different  procedures  for  man-human  interaction  for  rule 
discovery.  Since  it  is  a  simple  2-D  display,  we  are  confident  that  these  rule  discovery 
procedures  can  be  automated.  The  potential  integration  of  Visual  EAM  Avith  Air  Force’s 
Qualitative  Process  Automation  (QPA)  will  provide  a  powerful  tool  for  process  discovery 
and  automation. 


VI.  Conclusion 

Phase  I  has  investigated  the  necessary  components  for  process  discovery  and  procedures 
for  automation  to  increase  the  productivity  of  materials  research.  It  has  been  verified  that 
the  method  demonstrated  by  Park  is  efficient  and  capable  of  interpretation  of  ellipsometry 
measurements.  AI  WARE’S  FLN  control  toolkit  has  demonstrated  the  capability  of 
learning  a  model,  and  controlling  the  process  in  the  optimal  manners.  Analysis  has  been 
done  to  verify  that  misplacement  of  aperture  in  a  MBE  process  can  be  detected  and 
adjusted  back  to  the  proper  position  with  thickness  data.  The  development  of  Visual 
EAM  provides  a  powerful  tool  for  process  discovery  and  automation. 

In  conclusion.  Phase  I  has  demonstrated  that  process  discovery  and  automation  can  be 
done  -with  an  integration  of  advanced  technologies  in  neural  networks,  evolutionary 
programming,  learning  control,  and  intelligent  visualization  tools. 
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